

IDS No.: 2000-0509 



SYSTEM AND METHOD 



FOR ANALYZING COMMUNICATIONS 



This non-provisional application claims priority from U.S. Provisional Application No 
60/168,242 filed December 1, 1999 and titled "ContactMap: Designing Our Digital Lives" the 
entire disclosure of which is incorporated by reference herein in its entirety as if folly reproduced 



BACKGROUND OF THE INVENTION 

The present invention relates generally to communications, and more particularly, to a 
system and a method for analyzing a user's collection of communications, such as electronic 
mail messages, telephone messages, voicemails, instant messaging dialogues and telephone logs 
to determine the identity of contacts within the communications, the relative priority of the 
contacts within the communications and any relationships between the contacts themselves and 
between the contacts and the user. 

The ever-growing variety of mass communication devices and services such as wireless 
telephones, pagers, portable computers, personal digital assistants (PDAs), set top boxes, 
electronic mail, and instant messaging among others, has exponentially increased the number of 
different types of communications that one may receive on a daily basis. As a result, it has 
become very difficult to track one's personal and professional contacts since after even a short 
period of time, one could have hundreds of unique communications, such as electronic mail 
messages and voicemails to sort through. 

Accordingly, it would be desirable to be able to easily analyze a collection of 
communications, such as electronic mail messages, to determine the identity of one's personal 
and professional contact, the relative priorities of these contacts and any existing relationship 
between and among these contacts. 



herein. 
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SUMMARY OF THE INVENTION 

The invention provides a method for processing user communications to determine 
identities of contacts within the communications, to prioritize the relative importance of the 

contacts identified and to establish any relationships between the contacts themselves and 
5 between the contacts and the user. In one embodiment, the method includes the steps of 

receiving a plurality of communication files, wherein the communication files include a plurality 
of different contacts, reviewing the plurality of communication files to determine contact 
identities, contact priorities and relationships and providing an analysis of the communication 
files, wherein the analysis provides the identities of contacts identified in the communication 
10 files, the relative importance of contacts relative to one another and any relationships between 
two or more contacts. 

H In the present invention, a contact may be refer to an individual person, a company, a 

ry 

m group of individual people, an organization, an electronic mail listing or any combination or 
iSr variations of the aforementioned. Contacts may be related to a user's personal life, workplace or 
rU other social networks. Typically, the user will specify a directory, folder or other repository 
L which contains the communications, such as the electronic mail messages to be analyzed. A 
I s * communications or message processor will receive the communications and from these 
Q communications, determine at least the identity of the contacts and their relative priority to one 
2<3i another to the user. For example, contacts which engage in frequent, reciprocal communications 
with the user are rated or assigned a higher priority than contacts which only engage in one-way 
communications with the user. Though the sheer number of communications a contact may have 
with the user is relevant in determining the contact priority, the reciprocal nature of the 
communications is more indicative as to the importance of the communications. 

25 

In one embodiment of the present invention, contacts which are frequently mentioned or 
"co-mentioned" within communication may also be classified as being related to a similar 
grouping or network in relation to the user. For example, co-workers who frequently copy each 
other on electronic mail message may be considered to be part of a single grouping of contacts. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 illustrates an exemplary system in accordance with the teachings of the present 
invention. 

FIG. 2 illustrates an exemplary method in accordance with the teachings of the present 
invention. 

FIG. 3 illustrates another exemplary method in accordance with the teachings of the 
present invention. 

FIG. 4 illustrates yet another exemplary method in accordance with the teachings of the 
present invention. 

FIG. 5 illustrates still another exemplary method in accordance with the teachings of the 
present invention. 

FIG. 6 illustrates an exemplary message processing display in accordance with the 
teachings of the present invention. 

FIGS. 7a-7d illustrate an exemplary code implementation of a sorting method in 
accordance with the teachings of the present invention. 

FIG. 8a-8b illustrate an exemplary code implementation of a clustering method in 
accordance with the teachings of the present invention. 
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DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides a system and method for analyzing a user's 
communications, such as electronic mail messages, voicemail messages, instant messages, 

5 telephone logs and other such communications. The communications are analyzed to determine 
the identity of any contacts within the communications, the priority of the contacts within the 
communications and any inferred relationships between the contacts themselves and between the 
contact(s) and the user. As used herein, the term contact is used to refer to an individual person, 
a company, a group of individual people, an organization, an electronic mail listing or a 
10 combination of the aforementioned with which a user may communicate and/or receive 
communications from. For example, a user's colleagues in a work environment may be 
considered contacts as well as members of a user's family may be considered contacts. Contacts 

u 't may also be part of and classified into contact groupings, such as related contacts in a family 

ly m grouping or colleagues in a work grouping. 

m 

rf Referring to FIG. 1, a system 10 of the present invention includes a communications 

5 U processor 20 which analyzes a user's communications, such as the user's electronic mail 
U messages 30, voicemail messages 40, instant messages 50, telephone logs 60 and/or other 
H communications 70 to determine contact identities, priorities and relationships. In a preferred 
203 embodiment, the communications processor 20 may be configured as specialized software which 
q implements the methods disclosed here and may be run on a computing device, such as a 
personal computer, personal digital assistant, wireless phone or other similar device. Such 
specialized software which implements the methods described herein may also be used in a 
network device, such as a network server or electronic message server, such as an electronic mail 
25 message server. In the present invention, the communications processor 20 will receive and 

analyze the communications, such as electronic mail messages 20, and provide a contact network 
display 80 to the user. In one embodiment, the contact display 80 may include a listing of the 
identified contacts from the communications. Such a listing may be prioritized and arranged in a 
manner according to the rankings of the contacts as determined from an analysis of the 
30 communications, as shown and described later herein. 



4 
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Referring to FIG. 2, a first generalized embodiment of a process for analyzing a user's 
communications in accordance with the teachings of the present invention is shown. In this 
embodiment, a plurality of communications are accessed or extracted, step 200, such as by 
communications processor 20 previously shown in FIG. 1 . Once accessed, the communications 
5 are processed to determine the relative importance of contacts based on the communications, step 
210, such as by the communications processor 20. Once processed, a contact display 220 is 
provided to the user, step 220, which may include clusters of related contacts. The user is then 
free to approve and/or modify the contact display as the user sees fit. 

10 Referring to FIG. 3, another embodiment for analyzing a user's communication is shown 

As shown in FIG. 3, a collection of communication messages, such as electronic mail messages, 
are received from a user, step 300. In one embodiment of the present invention, the user may 
; = i specify certain folders, directories, networks or other locations in which these communication 
messages are stored. These communication messages are processed, step 310, which may 

ry 

15n include determining the amount of messages from certain specific contacts, an analysis of the 
: r: message content and/or a determination if the user and the contact have participated in mutual 
TU reciprocal communications. Once the messages are processed, a contact ranking is determined 
\ s ± based on the processing of the messages, step 320. A contact display of the user's identified 
contacts is then provided to the user, step 330. In alternative embodiments, other information 

203 may be provided to the user via the contact display, such as the priority ratings of each contact 
|=5 and any determined relationships between contacts and the user. 

In one embodiment of the present invention, the communications analysis process may be 
accomplished by initially having a user provide or specify one or more electronic mail folders to 

25 process. Once the electronic mail folders are specified, electronic mail message header 

information from the electronic mail messages are analyzed to determine a relative importance of 
entities or contacts mentioned in the messages. For electronic mail messages, the header fields, 
such as the to:, from:, cc:, and bcc: fields may be analyzed to determine which contacts are 
mentioned or referenced in the messages. In one embodiment, greater priority/weight is attached 

30 to contacts who engage in frequent, reciprocal communication (i.e., the user responds to 
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messages from the contact and vice versa), as opposed to if the user sends or receives 
communications without any reciprocity. 

Referring to FIG. 4, another embodiment of the present method for analyzing 
5 communications is shown. In this embodiment, a user may be prompted to identify a collection 
of existing communication files, such as a directory or folder of electronic mail messages, step 
400. The identified communication files are accessed, step 410, for example, by accessing a 
network directory or folder containing the electronic mail messages. Once accessed, the 
communication files are processed, step 420. Processing the communication files, step 420, may 
10 include determining the identities of contacts within the communication files, substep 422; 

determining the priorities of contacts relative to one another and/or relative to the user, substep 
424; and determining any relationships among the contacts and/or the user, substep 426. Once 
! 5 the communication files have been processed, the user's contact preferences are received, step 
: S| 430. The user's contact preferences may be in form of approvals and disapprovals of the contact 

identities, priorities and arrangements determined in step 420. For example, if a contact X is 
^ determined to be a higher priority contact than a contact Y, the user may override such a 
rU determination and assign a higher priority to contact Y even though contact X was previously 
L determined to be of higher priority. 

Ld 

203 In the present invention, it is contemplated that users can process their communication 

'"i files, such as their electronic mail messages, via the communications analysis process described 
herein at any time to create a contact network or incrementally update their contact network 
display. For example, the message or communications processing may be used to identify new 
contacts within a user's electronic mail messages and possibly add these new contacts to the 

25 user's workspace. Such an updating is best performed at some incremental time interval to 
ensure that the user's contact network contains the most updated set of contacts. 

Referring to FIG. 5, another embodiment of the present method for analyzing a user's 

communications is shown. In this embodiment, one or more contacts are identified within a 

30 communication, step 500, such as be reviewing a header portion of an electronic mail message 

and/or voicemail message. A determination is made whether the identified contact or contacts 

6 
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have engaged in frequent communications with the user, step 510. Such a determination may be 
made, for example, by simply counting the number of messages a contact either sent to the user 
and/or received from the user. If the contact or contacts have engaged in frequent 
communications with the user, the contact or contacts are assigned a higher priority, step 520. If 
the contact or contacts have not engaged in frequent communications with the user, the contact 
or contacts are assigned a relative lower priority, step 530. The threshold for what constitutes 
frequent communications may be set at any level. For example, the threshold may be set at ten 
(10) communications, such that if it is determined that a contact engaged in at least ten (10) 
communications with the user, that contact will then be assigned a higher priority. 



Referring again to FIG. 5, another determination is made to decide whether the 
communications between the user and the contact or contacts are reciprocal communications, 
! «3 e.g., if the communication from the contact is in response to an initial communication from the 
""4 user, step 540. Reciprocal communications can also be initiated by a contact and thus responses 
liS by the user to an initial communication by the contact may also be considered to be reciprocal 
l p communications. If the contact or contacts have engaged in reciprocal communications with the 

!:=S3 

fU user, the contact or contacts are assigned a higher priority, step 550. If the contact or contacts 
have not engaged in reciprocal communications with the user, the contact or contacts are 
assigned a relative lower priority, step 560. For example, if a user receives a message from a 
2Q3 particular contact, and the user does not reply to the message, the contact may be given a lower 
priority rating. Additionally, if the user continually receives message from a particular contact 
and never replies to the message, the contact may be given an even lower priority rating. 



Referring again to FIG. 5, a determination may be made based on the recency of the 

25 communication, step 570, e.g. how old the communication is relative to other communications 

being analyzed. Such a determination may be made, for example, by simply processing the date 

field in an electronic mail message header. If it is determined that the communication is a recent 

communication, the contact associated with the recent communication may be assigned a higher 

priority, step 580. If the communication is determined to be an old or a relatively older 

30 communication, the contact associated with the communication may be assigned a relative lower 

priority, step 590. The threshold for what constitutes a recent communication may be set at any 

7 
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level. For example, the threshold may be configured such that any communication that is less 
than a month old can be considered recent and any communication received that is older than a 
month may be considered not recent. Furthermore, additional temporal information such as the 
recency and duration of interactions or rapidity of responses in reciprocal interactions may be 
analyzed in determining the priority of contacts for a user. For example, contacts which engage 
in relatively longer duration communications and/or correspond with a user in rapid, reciprocal 
communications may be rated higher than contacts which do not. For example, if a contact often 
replies to a message from the user, or the user frequently replies to messages from that particular 
contact, then it may be inferred that the contact is important to the user. Another strong criterion 
for inferring importance is if the user often sends messages to the contact. Frequent attempts by 
the user to target the contact may be some indication of the contact's value. Weaker criteria are 
that the Contact often sends messages to the user, or is mentioned in (but does not originate) 
messages to and from the user. 

Referring yet again to FIG. 5, another determination is made to decide whether there are 
any co-mentions in the communications, step 592. As used herein, the term "co-mentions" refers 
to certain contacts that are referenced or mentioned within the user's communications, such as in 
their electronic mail messages. For messages or communications involving more than a sender 
and a single recipient, the group or set of contacts that are mentioned together in the same 
message will be referenced as "co-mentions". Relations are thus inferred between contacts on 
the basis of these co-mentions. For example, the more that people are mentioned together, the 
stronger the inferred relationship, as discussed in more detail later herein. 

In one embodiment, the system message processing or communications analysis 

determines whether or not contacts are mentioned in the header fields of the same electronic mail 

message. For example, if two people are repeatedly mentioned in the "to:" field of an electronic 

mail message header from a given contact, then an inference may be made that there is a 

relationship between that contact and the two co-mentioned people. This information about 

relationships has two potential benefits: (a) it allows the construction of groups, with 

straightforward addressing since instead of having to set up electronic mail aliases for groups a 

user repeatedly communicates with, these groups may be detected automatically and can be used 

8 
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for rapid addressing of messages to multiple individuals; (b) co-mentions may be useful in 
guiding contact layout, e.g. with co-mentioned contacts being near one another in the initial 
automatic layout configuration of the user's workspace display. In another embodiment of the 
present invention, co-mentions may be determined by determining whether a contact routinely 
forwards a message from another contact to the user or vice versa. In such a case, the contact 
from which the forwarded message originated, may be considered to be related to the contact 
which forwarded the message and/or also the user. Referring back to FIG. 5, if two or more 
contacts have been co-mentioned within the communications, a relationship between the contacts 
themselves and/or the user is inferred, step 594. If there are no co-mentioning of contacts within 
the communications, no relationship between the contacts themselves and/or the user is inferred, 
step 598. 

In one embodiment of the present invention, importance ranking of contact(s) may be 
represented as a linear equation using the following weightings on the above criteria: 

Importance score = Kxi + LX2+ Mx 3 + Nx 4 , 

where 

• xi = number of messages the user replies to from the originating contact; 

• x 2 = number of messages the user sends to the contact excluding replies; 

• x 3 = number of reply messages from the contact; 

• X4 = combined total of messages that are not composed by the user in which the contact is 
mentioned in the cc or to lines, plus messages from the contact that are not replies; 

• K, L, M and N are constant weightings. K applies to messages involving reciprocity and is 
very large to respect the significance of reciprocal interactions initiated by the user. L is 
moderately large, based on the intuition that if the user initiates a communication with a 
contact this suggests that they attach importance to that person. M is also moderately large to 
respect the importance of contact initiated interactivity. N is small based on the proposition 
that little value is attached to messages initiated by other people that are not responses, e.g. 
some of which may be "spam". Similarly, passing mentions of a contact name in messages, 
to, and from, others may not be highly rated. 
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Referring now to FIG. 6, an exemplary contact network display or listing 600 is shown. 
After the communications analysis or processing as described above is performed, the user may 
be provided with such a display or listing 600. In one embodiment, the display 600 includes a 
5 name section 610, an address section 620, an oldest message date section 630 and a newest 
message date section 640 and a score section 650. An approval or "include" section 660 may 
also be provided to the user, which allows the user to include or exclude any number of contacts 
from the user's contact network. 

10 In the present invention- any number of scoring schemes may be used to rate or prioritize 

contacts based on the communications processing discussed herein. For example, a 1 to 300 
scale may be used, a 1 to 10 scale may also be used as well as an A-Z rating scheme. Rating 
l= :f points or allocations may be provided to a contact in also a number of different manners. For 
HI example, a contact may be assigned or awarded a single point for every piece of communication, 
lgS i.e. electronic mail message, voicemail message, instant message, etc. that the contact 
'*2 participated in with the user. For example, if a certain contact sent ten electronic mail messages 
rU to the user, the contact will be awarded 10 point under this exemplary scoring scheme. The 
contact's cumulative score may be adjusted by a number of factors, such as by increasing the 

:~ m score if the contact and the user engaged in reciprocal rather than one-way communications. The 

Ly 

203 term "one-way" communications being defined as if the contact sent the user certain 

.ret 

!=i communications, but the user never responded to the communications, such as may be the case 
in terms of "SPAM" or unsolicited electronic mail. Using another exemplary rating or scoring 
scheme, the contact score may be modified based on certain portions of the communication's 
content, for example, if the content of a message is determined to have many informal, slang, or 
25 otherwise colloquial uses of language in the message content, the contact may be considered to 
be a personal rather than a business or professional contact and then rated higher or lower 
accordingly depending on the preferences of the user. 

In implementing the teachings of the present invention, it is preferable to develop the 
30 system and method herein with an object-oriented structure which may be written in an object- 
oriented language such as Java. One exemplary implementation may include a set of higher- 

10 
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level user interface components on top of an underlying persistent contact-based data layer and 
message analysis component. Data objects may be strictly separated from their display to 
allow multiple display components to display the same underlying data. The Java delegation 
event model may be used to allow user interface components to easily track changes to 
5 underlying data objects. Java object serialization may also be used to persistently store contact 
information. Referring to FIGS. 7a-7d, an exemplary code listing is shown which may be used 
to prioritize contacts as discussed herein is shown. Referring to FIGS. 8a-8b , an exemplary 
code listing is shown which may be used to cluster contact in accordance with the teachings 
herein. 

10 

It will be apparent to those skilled in the art that many changes and substitutions can be 
made to the system and method described herein without departing from the spirit and scope of 
S3 the invention as defined by the appended claims. For example, natural language analysis 
%j techniques and programs that analyze, for example, electronic mail messages, may be used. 
\$i Additionally, voice recognition analysis techniques and programs may also be used to analyze 
! =Q the communications, such as voicemail messages. For example, voice recognition analysis may 
ill be used to convert voicemail messages into text. The text may then be processed to extract 
certain contact information from the text, such as contact names, numbers, etc. 
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We claim: 

1 1 . A method for analyzing one or more user's communication messages which contain 

2 communications from a plurality of user contacts, the method comprising: 

3 accessing a plurality of communication messages; and 

4 processing the plurality of communication messages, wherein the processing of the 

5 plurality of communication messages includes: 

6 determining the identity of the plurality of user contacts; 

7 assigning rankings to each of the plurality of user contacts based on the plurality 

8 of communication messages; and 

9 determining relationships which may exist between contacts and between the one 
10 or more users and the contacts user/contact relationships. 

i4 2. The method of claim 1, wherein contacts which engage in frequent communications with 

in the user are assigned higher rankings and contacts which engage in less frequent 

3 y communications with the user are assigned lower rankings. 

4U 

3. The method of claim 1, wherein contacts which engage in more recent communications 

|" with the user are assigned higher rankings and contacts which engage in less recent 

i s = 
':==? 

3:3 communications with the user are assigned lower rankings. 

it 

1 4. The method of claim 1, wherein the communication messages are at least one of 

2 electronic mail messages, voicemail messages, instant messages, telephone logs and facsimiles. 

1 5. The method of claim 1, wherein processing the plurality of communication messages 

2 includes: 

3 assigning higher priority to contacts who engage in reciprocal communications with the 

4 user. 
5 

12 
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1 6. The method of claim 1 , wherein processing the plurality of communication messages 

2 includes: 

3 determining if a contact has engaged in reciprocal communications with the user. 
4 

1 7. The method of claim 6, further comprising: 

2 assigning higher priority to contacts who engage in frequent reciprocal communications 

3 with the user. 
4 

1 8. The method of claim 1 , wherein processing the plurality of communication messages 

2 includes: 

3*% inferring relationships between two or more contacts based on the mentioning of the 

40 contacts within the communications. 

sy 

j ~ 9. The method of claim 1 , wherein the relationships are inferred between contacts based on 
the mentioning of contacts within one or more communication message headers. 

ru 

jL 10. The method of claim 1 , wherein processing the plurality of communication messages 

#J includes: 

3~3 employing voice recognition techniques to convert voicemail messages to text in order to 

■ S!= 3 

4^ extract contact information from the communication messages. 
5 
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1 11. A method for processing user communication files to determine identities of contacts 

2 within the communication files, to prioritize the relative importance of the contacts identified and 

3 to establish any relationships between the contacts, the method comprising: 

4 receiving a plurality of communication files, wherein the communication files include a 

5 plurality of different contacts; 

6 reviewing the plurality of communication files to determine contact identities, contact 

7 priorities and relationships; and 

8 providing an analysis of the communication files, wherein the analysis provides the 

9 identities of contacts identified in the communication files, the relative importance of contacts 
10 relative to one another and any relationships between two or more contacts. 

,1 12. The method of claim 1 1 , wherein reviewing the plurality of communication files 

'S-3 

includes: 

L |: determining whether the communication files are unsolicited communications or 

|# responsive to communications initiated by the user. 

J 

ff 13. The method of claim 1 1, wherein reviewing the plurality of communication files 

i2J includes: 

:£ determining which contacts are co-mentioned within the communication files. 

■~" 

1 14. The method of claim 13, wherein if two or more contacts are co-mentioned with two of 

2 more communication files, the two or more contacts are determined to be related. 

1 15. The method of claim 11, wherein reviewing the plurality of communication files 

2 includes: 

3 parsing the plurality of communication files to determine whether prior reciprocal 

4 communications exists between a particular contact and the user. 
5 

1 16. The method of claim 11, wherein the communication files are electronic mail messages. 

2 14 
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1 17. The method of claim 1 1 , wherein the electronic mail messages are identified by the user. 
2 

1 18. A method for identifying a user's potential contacts, prioritizing these potential contacts 

2 and determining any relationships between these potential contacts based on a plurality of stored 

3 messages, the method comprising: 

4 analyzing the plurality of stored messages which includes: 

5 identifying the potential contacts within the stored messages; 

6 determining the relative importance of the identified potential contacts; and 

7 determining any established relationships between the potential contacts 

8 mentioned in the stored messages; and 

@3 receiving from the user approvals of the potential contacts which have been identified 

and prioritized. 

ig 19. The method of claim 18, wherein the relative importance of contacts is based on one or 

SI more of the frequency of communications between the contacts and the user, the recency of the 

3 communications between the contacts and the user and the reciprocity of the communications 

;44 between the contacts and the user. 



O 20. The method of claim 1 8, wherein the relative importance of contacts is based on the 



4 where xj = number of messages the user replies to from the originating contact; Xi = number of 

5 messages the user sends to the contact excluding replies; X3 = number of reply messages from the 

6 contact; X4 = combined total of messages that are not composed by the user in which the contact 

7 is mentioned in the cc or to lines, plus messages from the contact that are not replies; and 

8 K, L, M and N are constant weightings. 
9 



2 following equation: 



3 



Importance = Kxi + LX2+ Mx 3 + Nx 4 , 
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